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Abstract 

We revisit the derivation of the density of states of sparse random matrices. 
We derive a recursion relation that allows one to compute the spectrum of the 
matrix of incidence for finite trees that determines completely the low concen- 
tration limit. Using the iterative scheme introduced by Biroli and Monasson 
[J. Phys. A 32, L255 (1999)] we find an approximate expression for the den- 
sity of states expected to hold exactly in the opposite limit of large but finite 
concentration. The combination of the two methods yields a very simple geo- 
metric interpretation of the tails of the spectrum. We test the analytic results 
with numerical simulations and we suggest an indirect numerical method to 
explore the tails of the spectrum. 

LPT-ENS 02/16, LPTHE 02/10 



1 Introduction 

The list of physical applications of random matrix theory is very long. In nuclear 
physics, the Hamiltonians of heavy nuclei are replaced by large random matrices 
and their eigenvalues are associated to the energy levels. In condensed matter the- 
ory, random matrices are used to mimic the Hamiltonians of dirty conductors in the 
study of the metal-insulator transition. Random matrices represent the interactions 
in disordered magnets as spin-glasses. These physical systems are intimately related 
to optimization problems and the theory of random matrices also plays an important 
role in computer science. Random matrices are the incidence matrices of random 
graphs and have been extensively studied in the context of graph theory. Conse- 
quently, the development of random matrix theory and, in particular, the study of 
the spectrum of random matrices has a long history [|I|, . 
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The most commonly studied random matrices are symmetric with independent 
identically distributed real elements. If the entries are taken from a Gaussian dis- 
tribution the averaged spectrum of infinite matrices obeys the Wigner semi-circle 
law ]I|, 0, |3|, IJ. For such matrices studied in the thermodynamic limit, the only 
way to modify this result is to consider the sparse case in which the mean number 
of non-zero elements on each row remains finite when the size of the matrix di- 
verges m . If the elements in the random matrix represent the exchanges of a model 
with two-body interactions, the fully connected case corresponds to a matrix with 
non-vanishing entries outside the diagonal || while the sparse limit corresponds to 
a model with finite connectivity J7|. The latter problem is much harder than the 
former. Even if replica || [J and super-symmetric f| [T(| methods have been used to 
compute the spectrum of dilute matrices, the derivation of a full analytic expression 
for it seems out of reach. A major difference between the semi-circle law and the 
density of states of dilute matrices lies in the presence of unbounded tails in the 



latter case @, II], which are absent in the thermodynamic limit of the former |I[ 



In this article we study the density of states of a sparse random matrix using a 
variety of techniques. We take special advantage of one recently proposed by Biroli 
and Monasson fl2|| . Even if not all our results are new, we believe that it is useful 
to derive them with a method that emphasizes the underlying geometric origin of 
the tails in the spectrum, namely, the fluctuations in the site connectivities. In a 
first part, we present the results stemming from cluster expansions [O], recovering 



results mentioned by Mirlin and Fyodorov |5| and Bauer and Golinelli |IIJ, and 
showing how the density of states of any finite random matrix can be obtained in an 
iterative way (Section |3|). Second, in Section f| we use the replica approach, following 
the scheme of approximation that is based on the variational method developed 
in ||12j| . By emphasizing the links between the cluster expansion and the replica 
functional variational method, we give a very simple explanation of the asymptotic 
form of the tails of the distribution derived by Rodgers and Bray ||, and we give 
the first steps of its more precise investigation. In Section |5] we present the spectra 
obtained from the numerical diagonalisation of finite matrices. In particular, we 
demonstrate the importance of finite size effects as far as the dependence on the ferro 
or anti-ferromagnetic character of the interactions is concerned. In the conclusions 
we summarize our results and we show how the study of the dynamics of dilute 



disordered spin models are related to them 14 



2 Definitions 

Let us consider an N x N symmetric matrix with real elements Jy. We constrain 
the diagonal entries to vanish Ju = and we denote its N eigenvalues. The 
elements {i < j) are independent, identically distributed random variables with 
a distribution law P(Jij). We adopt square brackets to indicate an average over this 
distribution, [ • ]. The central quantity under study is the average density of states 

(1) 



p{fi) = 



1 N 

k=l 
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in the thermodynamic limit, N — > oo. 

For most possible choices of P(Jij) p(p) is given by Wigner's semi-circle law 
except when one considers very sparse random matrices with a finite number of non 
zero elements per row in the thermodynamic limit ||. In this case, 

P(Jii) ={ 1 -Jj) *(4>) + £*W > ( 2 ) 

with p finite. 7r(Jy) is a normalized distribution with average and mean square 
deviation of order one (this ensures a sensible thermodynamic limit) which does not 
contain a delta peak around zero, p is the mean number of non zero elements per 
row, i.e. the mean connectivity of a given site. 

In the following, we concentrate on a bimodal distribution 

n(Jij) = a 5(Jij - J ) + (1 - a) 5( J y + J„) (3) 

for the non-vanishing entries in the random matrix. The parameter a controls the 
asymmetry between the "ferromagnetic" or "anti-ferromagnetic" tendency in the 
model. Since we can absorb the dependence on J with a global rescaling of the 
density of states, p(p) = (l/Jo)/5(///Jo), we set Jq = 1 henceforth. 



3 Cluster expansion 

Given a particular realization of the matrix J among the random ensemble, one can 
associate it to a graph in a very natural way. Consider N sites labeled from 1 to iV 
and draw a link between the sites i and j if the entry Jjj is non zero. The value Jjj 
of the interaction can be written next to the link. Two sites are called adjacent if 
there is a link between them, and connected if there is a path of adjacent sites from 
one to the other. A cluster is a set of connected sites, disconnected from all others. 

These definitions make the relation between random matrices and random graphs 
explicit. As a matter of fact, the study of the ensemble of such graphs (forgetting 
the values of the interactions) is nothing but the well known random graph problem. 
When p < 1 the random matrix J is so sparse that all sites belong to finite clusters. 
At p = 1 a percolation transition occurs and a giant cluster containing a finite 



fraction of the sites appears. The rest of the sites belong to finite clusters JT3 . 

In this Section we derive a cluster expansion of the density of states, i. e. a de- 
velopment in powers of the mean connectivity p of p(p). This expansion is not valid 
beyond the percolation threshold; still, some results derived here give a geometrical 
insight into the behavior of p for all p, at least in the limit |p| — > oo. 

If one orders the sites by grouping those belonging to the same cluster, the matrix 
J acquires a block-diagonal form, each cluster being associated to one block. The 
density of eigenvalues for a particular realization of the ensemble, pj(p), can then 
be written as a double sum, 

^Pj(p) = ^$>(p-p fcc ), (4) 

C k c 
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where the index C labels the clusters and kc labels the eigenvalues of each cluster. 
Under this form, pj(p) is an additive quantity over clusters and we can apply the 
technique described in [|13| to transform this expression into a simpler one. Indeed, 
clusters can be separated into ensembles with the same topology, ignoring for the 
moment the assignment of the values of the interactions. The averaging over dif- 
ferent realizations of the random matrix proceeds in two steps; one first chooses 
the topology of the cluster, with its associated probability, and, subsequently, one 
averages over the values of the interactions with the distribution 7r(Jy). For a given 
cluster, once the latter average is performed, the density of states depends only on 
its topology. (Note that one could do so for any distribution n, even if not bimodal.) 
This remark allows us to rewrite the average of the sum in Eq. (|J) in a more con- 
venient manner. If we introduce an index t that labels all possible topologies, n t (J) 
the number of t-like clusters present in a given realization of the ensemble, and p t 
the average over the distribution ir of the density of states of the t-like cluster p t , 
we obtain the following expression for the averaged density of eigenvalues: 

p^) = [pjm = Y. [ -w L ~p^)- (5) 

t 

We included the isolated sites associating them to the index t = and we denoted 



[fit] the average number of t-like clusters. As in [|13|] we introduce a function X\ that 
takes the value 1 if the site i belongs to a i-like cluster and otherwise and we call 
L t the number of sites in a t-like cluster. Then 

n l( J) = i-$>7, I W = l W ] = g, (6) 

i 

where Pt = [X^] is the probability that a given variable belong to a t-like cluster. 
Finally, 

M = E p * PtM = E t S ^ - ^ ' ( 7 ) 

t t 1 i=i 

where p\ are the eigenvalues of the t-like cluster, P t is given by Pt = p Lt ~ l e~ pLt K t . l 
and K t is a symmetry factor [13|. The overline denotes an average over the dis- 
tribution Ti, for fixed topology. Note that only tree-like clusters contribute to the 
thermodynamic limit: the probability that a site belong to a cluster containing a 
loop of finite length is of order 1/N. Reordering this series in powers of p, it is clear 
that all clusters of size L t < m + 1 have to be considered to obtain the expansion 
at order p m . 

For the bimodal ir(Jij) each type of cluster contributes a sum of a finite number 
of delta functions to p(p). The location of the delta functions are symmetrically 
distributed around p = 0. The only task remaining in order to obtain the series 
expansion is to diagonalize the finite size matrices corresponding to each cluster. 
For instance, up to first order in p, one finds (for any value of a): 

p(jj.) = 5(n) + \ [5(p - 1) - 25(p) + 5(p + 1)] . (8) 



4 




Figure 1: Two examples of finite clusters: a linear (n = 4) and a star-like one 
(* = 8). 



There does not seem to be a straightforward way to compute the eigenvalues of an 
arbitrarily shaped tree-like cluster. A possible method would be to solve the integral 
equation arising from the replica or the super-symmetric approach perturbatively 
and to expand the order parameter in powers of the concentration, as explained 
in ||. Another method, exposed below, is to establish recursion relations on the 
characteristic polynomial of finite size matrices [TJj . 



Before getting into the general derivation, let us study two examples of simple 
clusters for which one can compute the eigenvalues explicitly. Consider first a linear 
cluster with n sites (see the left panel of Fig. |l|). For this structure the corresponding 
block in the matrix J has elements ±1 on the two diagonals next to the main one, 
and everywhere else. It is then a simple exercise of linear algebra to show that the 
n eigenvalues are 

In 



Hi = 2 cos y—p[) > l = l,...,n. (9) 

Note that for any value of n, the eigenvalues belong to [—2, 2}. The "opposite" kind 
of geometry is given by clusters with one site connected to k neighbours (see the 
right panel in Fig [I]). Again the matrix J is readily written. One easily shows that 
the spectrum of such a cluster has k — 1 vanishing eigenvalues, one eigenvalue given 
by \fk and another one given by —yk. This behavior is very different from the 
one of the linear clusters: more and more branched clusters yield larger and larger 
eigenvalues. Very important for the arguments developed in Section ^ is the fact 
that the eigenvectors associated with the eigenvalues ±yh have a larger magnitude 
on the central site than on its neighbours, with a ratio yk. Thus, for large k, the 
eigenvectors are more and more concentrated on the central site. We shall come 
back to this point later, when discussing the behavior of the density of states at 
large eigenvalues. 

To derive the general recursion relation we focus on tree-like structures since 
loops of finite size disappear in the thermodynamic limit. We define a rooted tree 
r as a tree with one of its sites (the root) particularized. We represent a generic 
rooted tree as a bubble englobing the tree, with only the root shown (see Fig. |2[). If 
the root of the tree r is connected to, say, k other sites that we call its neighbours, 
we denote r i; i — 1, k, the trees whose roots are the neighbours of the original 
root and 6j the values of the edges that link the original root to the k secondary 
ones (see Fig. H). 

We call P r (X) the characteristic polynomial of the matrix associated with the 
rooted tree r, and P r (X) the one of the matrix in blocks associated to the discon- 
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Figure 2: An example of rooted tree 

x n W2 ^r k 





Figure 3: Decomposition of a rooted tree 



nected trees obtained after deleting the root of r and the edges emerging from it. 
We thus have 



p,(A)=n^(A) 



(10) 



1=1 



With these definitions the recursion relation is obtained by expanding the determi- 
nant along the rows and the columns of the roots, and reads 



p r (\) = -An^(A) - j2^ p nwu p rM) ■ 



IF 



1=1 



Our first remark is that the parameters e« appear only through their squares, so we 
can prove by recurrence that for any tree, the average spectrum for the bimodal 
distribution (||) does not depend on the value of a. Thus, when p < 1 and the 
thermodynamic limit is taken p(p) is totally independent of the ferromagnetic or 
anti-ferromagnetic character of the distribution. When p > 1, loops of length of 
order IniV appear in the giant cluster and the argument above, relying on the tree 
structure of the interactions, is not valid anymore. We shall develop this point in 
Section 

As a simple example, let us now write down the characteristic polynomial for the 
two level tree sketched in Fig. The root has k neighbours and each of them has 
k + 1 neighbours, i = 1, k. If we define L = Ylih: the characteristic polynomial 
is given by 



P(A) = (-A) 



L-fc+l 



(12) 
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whose roots give the L + k + 1 eigenvalues of the corresponding matrix. For instance 
in the symmetric case (I4 = I Vi), the eigenvalues (and their degeneracies) are: 
(k(l — 1) + 1) , ±v^ (k — 1 for each sign) and ±y/k + I (1 for each sign). 

In principle, the relation (|TTD can be iterated to obtain the characteristic poly- 
nomial of any tree, and from it its eigenvalues. 




Figure 4: Two level tree 



To conclude this Section, let us note than even for p > 1, a finite fraction of 
sites belong to finite-size clusters. Thus, the spectra contains delta peaks located at 
eigenvalues of finite tree clusters for all p, as established in [D], [H| (it is also shown 
in 



11] that parts of the giant cluster contribute to these delta peaks) 



In the context of the gelation transition, a detailed study of the low concentration 
phase for related matrices was performed in 



4 Replica approach 

In this Section we compute the spectrum using the replica method. We first give a 
qualitative argument to suggest where one can expect to find localized states. Then 
we describe the method and find a series of approximate expressions for p(/i). 

4.1 Localized eigenvectors on the Bethe lattice 

One cannot use the above cluster expansion to derive the density of states above 
the percolation transition p = 1 where a giant component with a finite fraction of 
the sites appears in the thermodynamic limit. The giant cluster contains loops with 
length O(lniV) so, locally, it has a tree-like structure . 

It is interesting to know if some eigenvectors of the sparse matrix are localized, 
i.e. if they have a finite fraction of their weight on a finite number of sites. A 
qualitative answer to this question can be given by assuming that one site has 
k neighbours, and that all the other sites have the same connectivity p + 1 (the 
average connectivity of a site having already one neighboor is p + 1), with no loops. 
That is to say, we approximate the giant component with a Bethe lattice with a 
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single "defect", the site with a connectivity that is different from the mean. It is 
then easy to derive a condition for the existence of an eigenvector localized on this 
defect and with a spherical symmetry i.e. such that its magnitude on each site 
only depends on the distance to the central site. For the ensemble of matrices under 
consideration, one finds the condition: k > 2p. Thus, we expect localized eigenstates 
to appear around sites with connectivities much larger than the mean. Note that 
for the Laplacian of a random graph (instead of the incidence matrix considered 
here) the same argument leads to \k — p\ > ^Jp [12] for the localization condition 
of eigenstates; the required deviation of the local connectivity from the mean is less 
stringent in this case. 

This argument can be refined by considering a site with k neighbours, each having 
I + 1 neighbours, and all other sites having the mean connectivity p + 1. Then the 
condition for the existence of a localized eigenstate is : k + 21 > 4p. Of course if one 
takes / = p, the above condition is recovered. If I is slightly lower than the mean, 
the condition on k becomes more and more demanding. Conversely, if / is slightly 
higher than the mean, the condition on k is relaxed. One can expect the latter 
case to be privileged in random graphs for which the distribution of connectivities 
is Poissonian: sites with connectivity of order twice the mean are rare, but their 
neighbours are likely to have a connectivity slightly above the mean. 



4.2 Method and definitions 

Field theories, either replicated || or super-symmetric 0, |J, have been used to de- 
termine the average spectrum of random matrices. For sparse matrices, the result 
is expressed in terms of the solution to an integral equation, which has proven too 
difficult to be solved exactly. Approximate results, either perturbative or non per- 
turbative || with respect to the inverse concentration 1/p have been obtained. We 
shall revisit this problem using the replica approach and the scheme of approxima- 
tion developed by Biroli and Monasson [12] (see also [19] for a similar scheme). This 



method will allow us to unveil the geometric origin of the tails in the distribution 
and to compare with results established above for finite size clusters. For simplicity, 
we focus on the symmetric bimodal distribution introduced in (|3]) with a = 1/2. We 
come back to the dependence on a in Section [4.6| . 

The normalized density of states for a particular realization of disorder can be 
computed in terms of the resolvent [|TJ , 



Ntt 



Im Tr [Jij — ((/, + ie)5,, 



i-i 



'j J 



2 d 
— — Im— In Z(u) 



(13) 



fi+ie 



with e > and infinitesimal, and 



z(jA 



n 



exp 




(14) 



The average over disorder, [pj(/i)], is evaluated by introducing replicas to average 
the logarithm, lim n ^ hi[Z n ] = nflnZ]. A very useful way to study the mean field 
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theory of dilute systems |^D| relies on introducing a function c(0) that measures the 
fraction of sites with a field fa equal to a chosen value 0, the vector denoting the 
n-dimensional replica space, 



(15) 



Thus, 



with 



[z n ] 

5 EFP (c(0)) 



p(/i) = lim 



2 (9 
. — Im— In [Z n ] 



(16) 



Vc{4>) exp(-NS E¥F (c(<j>))) , 



d(f)c((f)) lnc(0) 



2 /i 



+ tf EFF ( C (0)) , 



(17) 



where the integration is taken only over normalized c(0). The effective Hamiltonian 
is defined as 

i 
2 



exp (-NH m 



exp 



'J 



J 



and for the ensemble of symmetric sparse random matrices with a bimodal symmetric 
distribution it reads 



H E „{c{<f>)) = - 



p p 



d<fi dip c(4>) c(ip) cos(0 • ip) . 



(19) 



In the limit of large sizes, iV — > oo, the path integral in Eq. (^) can be computed 
with the saddle-point method and it is dominated by the neighbourhood of the 
extreme, c sp (0), of the effective action S EFF , which satisfies: 



c{4>) = Af exp 



% -»o 5H EFF 

-ud> — 

2 8c{(t>) 



(20) 



with M a normalization constant. 

Then, the averaged density of states reads 



pU) = lim — Re / d(p c sp (0) <p 2 

n^O Tin 



(21) 



Even with the assumption of invariance of c(<ft) with respect to rotations in the 
n-dimensional replica space, it has proven too difficult to solve the saddle point 
equation explicitly. A numerical solution of a similar equation was given in the 
context of the study of the instantaneous normal modes in a liquid. 
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4.3 Effective medium approximation 

The first step of the approximation scheme [12| consists in solving the problem 
variationally, i.e. by restricting the original problem of extremization to the partic- 
ular Gaussian subspace of the functions c(4>). This first step is called the "effective 
medium approximation" (ema) since it amounts to assuming that all the sites are 
equivalent and play the same role. Thus one replaces c(0) with a Gaussian Ansatz 



c EMA (0) = (2ma^))- n / 2 exp ^"^yj > (22) 

with the variational function determined by the stationarity condition 

SS ETF /Sa(ii) = 0. After some algebra, we find that a = a(fi) is given by the cu- 
bic equation 

a 3 + V -^a 2 - a + - = . (23) 

Among the three roots of the equation, one has to choose the one with 
Im cr(yU + ie) < for the integrals to converge. The average density of states reads 
p EMA (/i) = —1/n Imcr(/i + ie). It is easy to check that these equations yield the 
correct Wigner semi-circle law in the fully connected limit, when J — > Jq/Vn and 
p — > N. The ema is exact in this case. 

In the following we focus on the case p > 1. For p < 1, the sites belong to finite 
size clusters and they are very heterogeneous in nature so we do not expect the ema 
to yield accurate results. Moreover the cluster expansion is exact in this regime and 
it is enough to solve the problem completely. The approach used here should be 
valid for p large but finite, as the larger the value of p, the smaller the fraction of 
sites in finite size clusters. 

Solving the cubic equation, one finds that for < A c the density of states is 
given by the continuous function 



V3 

2tt \| V 3/i J 6/i ^ Y 27/x 



p-iy P + 2 + /(A2- /U 2 ) ^2 + a 2 ) 



^1 3 _ - l±l _ / (Ag-/i 2 )(/i 2 + a 2 ) 

2vr \ V 3p J 6p y 27/i 4 



(24) 



and vanishes outside the band [— A c , A c ], with 



\3 



,-p 2 + 20p + 8+^p(p + 8) 3 2 p 2 -20p-8+ y/p(p + 8) 
A c =y , a = , (25) 

see the solid curve in Fig. |^. The density of states in the ema vanishes at ±A C as a 
square root, p EUA (fi) ~ (A c — l/ij) 1 / 2 , just as for a semi-circle law. Expanding Eq. (|24]) 
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in powers of 1/p one recovers the first orders of the 1/p expansion of Rodgers and 
Brayi: 



A, 



2VP 



7rA : 



:VA? 



1 



1 

P 



1 



A? 



(26) 



(27) 



It would be interesting to check if the ema and the perturbative solution of these 
authors lead to the same density of states at all orders in the expansion in powers of 
1/p. If this were true, the ema would be the exact resummation of the perturbative 
solution in M. 

Note that at this level of the approximation scheme there are no states outside the 
band [A c , A c ], which is consistent with the qualitative argument given in Section |TH] : 
localized states with high eigenvalues are due to fluctuations of the local connectivity, 
whereas, by definition, the ema assumes a uniform local connectivity. 



c (0) = v e^ 2 ( ~Jf- / d ^ c $) cos (<? • 



4.4 Single defect approximation 

To go beyond the ema, it is useful to write the saddle point equation ( PUD on c(0) 
in the following form 

k 

(2* 

k=0 ' *-J 

with r\ a normalization constant tending to 1 as n — > 0. This equation has the 
following geometric interpretation: any chosen site has k neighbours with probability 
exp(— p)p k /k\ and each of them interact with the central site through the effective 
Hamiltonian. The self consistent equation is obtained by imposing the equality of 
the distribution of effective fields c((f>) for all sites. 

In the single defect approximation, one uses the above saddle-point equation in 
an iterative form, i.e one inserts the result of the Gaussian approximation in the 
right-hand side of Eq. ([2~8|). This means that one allows the connectivity of a given 
site to fluctuate but its neighbours are treated as part of the effective medium. The 
procedure yields 



k=0 v 7 



^> = >,-^|-r) % + fc - to0 , + fc) - < 29 > 

For fi G [— A C ,A C ], the SDA density of states is different from the ema one; yet 
it still vanishes at ±A C as a square root. In Fig. [| we show the central band of the 
density of states in the ema and the SDA for p = 10; for comparison, we include 
a semi-circle with the same support. It is clear from the figure that the main 
modification introduced by the SDA is concentrated around \i ~ 0. 

Outside the band, delta peaks appear when the denominator of the above ex- 
pression vanishes. They are located at ±/ifc, with 

o-(jtijfc) = , W\ > A c , (30) 
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0.12 



3 




Figure 5: Extended part of the density of states for p = 10, in the ema and sda 
(A c ~ 6.60). A semi-circle of same width is shown for comparison. 



and the weight of the peak at p — ±pk is 



e p p k 



k\ 1 — ka'(fii> 



(31) 



As a is a real uneven decreasing function of p for \p\ > A c , there is a minimum 
value of k for which Eq. fl3!]| ) has a solution. A sufficiently large fluctuation of 
the local connectivity is necessary for localization, as predicted by the qualitative 
argument given in Section |4.1| . For instance, for p = 10, the first peak is found for 
kmin = 24. One can check that k m i n ~ 2p when p — > oo and the condition derived 



by the qualitative argument of Section [O] is recovered in this limit. 

Since the peaks extend all the way up to \p\ — > oo (as k grows, the peaks move 
away from A c ) we can now work out the asymptotic behaviour of the density of 
states. Far from the continuous band, i.e. when k — ■> oo and consequently \p\ — > oo, 
we have <r(p) ~ The peaks are thus located at p^ ~ ±^/k and their weight 

is exp(— p)p k /(2 k\) for both signs. In this limit, the distance between the peaks 
decreases, since (Vk- Vk~^l) ~ l/(2y/k) -> and one can propose a continuous 
approximation to describe the tails of the distribution: 



1 e~ p p 



2 k\ 



(32) 



After expanding for large k and changing variables k — > p 2 , one obtains the asymp- 
totic form 



P SD » 



2n 



exp 



-p 2 ln 



p_ 

pe 



2\ i 



fJL > A c 



(33) 



This result was obtained by Rodgers and Bray in ||, after an involved non pertur- 
bative treatment of an integral equation following a method proposed in [ 22 ]. 
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Let us emphasize the very simple geometric interpretation of this asymptotic 
form unveiled by the present derivation. A given site in a random graph has k 
neighbours with probability exp(— p)p k /k\. For large k, there are two eigenvectors 
localized on it, with eigenvalues ±y/k. It is then not surprising that in the limit 
of large k, one finds the same eigenvalues as for a star-like finite cluster. When k 
grows, the eigenvector is more and more concentrated on the central site and the 
eigenvalue is less and less sensitive to the environment of the first neighbours; if they 
are disconnected from the rest of the sites, as for finite clusters, or inside the giant 
cluster is not important in this limit [^3] . 



We would like to stress a similarity with a result in [JO]. These authors ex- 
press the moments of p(p) as polynomials in p and they provide a bound for 
the coefficients of these polynomials. Their lower bound is expressed in terms 
of Stirling numbers and polynomials that are the moments of the distribution 

f p (x) = ^ ^2T=o fcr(^( x — ^0 + <K X + y/k)) that we argued to be the asymp- 
totic form of the eigenvalue spectrum. Moreover, Stirling numbers also arise when 
considering star-like geometries via the enumeration of particular walks on these 
trees. There should be a deep connection between these two approaches beyond this 
formal similarity. 



4.5 Two level approximation 

Let us now briefly explain how to improve on the above results. The next step in 
the approximation scheme consists in iterating the saddle point equation once more, 
by inserting c SDA (0) in the right hand side of Eq. (p8|) . The spectrum in the interval 
[— A c , A c ] is again modified with respect to the ones found in the ema and SDA. 
However, it still vanishes at ±A C and keeps a more or less similar form. Outside this 
band, peaks indexed by k,l\, ... ,lk , with k and U integers, appear for values of \x 
that are solutions to the equation: 

k 

l = Y,-^ T\ with H>V (34) 

This corresponds to considering a central site having k neighbours, each having 
U + 1 neighbours, the latter being treated as part of the effective medium. One has 
thus a more precise description of the environment of the given site since its first 
neighbours are treated exactly. Again, if one considers the limit \p\ — > oo, for which 
<t(/z) ~ Eq. ( |3^| ) yields the roots of the characteristic polynomial of a finite 
cluster with the same geometry, see Eq. (|12|). In this limit, the part of the graph 
under consideration is not sensitive to the rest of the sites. 

For the same reasons explained at the SDA level, there are minimum values of k, 
{k}, for a solution to Eq. ( |34l) to exist. As predicted by the qualitative argument, 
there can be localized states with k less than k min of the SDA, provided the Z^s are 
not too small. For instance, if p = 10, there are solutions with k = 20 and all the 
liS equal to 13. 

Even if we have not derived an explicit asymptotic expression of the density of 
state at this level of approximation, we believe that the qualitative features of p SDA (^) 
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will not be modified. Indeed, large eigenvalues are due to sites having a connectivity 
much larger than the mean and these are rare events. Configurations in which the 
connectivity of the first neighbours also differs considerably from the one of the 
effective medium have even lower probability, they should then be negligible and 
yield only a small correction. Yet, it would be interesting to evaluate it and to check 
if it is consistent with the next terms of the expansion which, in principle, can be 
computed from the non perturbative method in ||. 



4.6 Asymmetric distribution 

In this section we discuss the influence of the asymmetry parameter a that quanti- 
fies the ferromagnetic or anti-ferromagnetic character of the interactions, see Eq. (|3[). 

We have shown rigorously that as long as the underlying graph is tree-like, the 
parameter a has strictly no influence on the spectrum. Thus, when p < 1, in the 
thermodynamic limit, p(p) is strictly independent on a. 

When p > 1, one can show that a has no effect on the iterative scheme of reso- 
lution of the saddle point equation. Indeed, for a ^ 1/2, the effective Hamiltonians 
becomes: 



- H m (c($)) = -| + | y d$d$ c{$) c$) (a e-*W + (1 - a) e^) . (35) 

Since c EMA (0) is even with respect to </> — > — 0, Eq. ( p3|) on er(/x) is not modified. 
Then c SDA ((/>) is independent on a and even so, at all orders of the iteration, the order 
parameter, and thus the spectrum, do not depend on a. 



The independence of p(p) on a was conjectured in |10| and shown rigorously 

in 



|Tl| : the moments of the spectrum do not depend on a, and their growth is suffi- 
ciently slow to determine the density of states. However, we observed numerically a 
strong dependence on a in finite size matrices (see Section^). As practical applica- 
tions of random matrix theory are often confined to small matrix sizes, this cannot 
be neglected. 



5 Numerical results 

In this Section we present the spectra obtained from the exact numerical diagonali- 
sation of finite size random matrices. 

In Fig |] we plot the average density of states, computed on 500 samples of 
2000 x 2000 matrices, with mean connectivity p = 10 and a symmetric distribution 
of the matrix elements, a = 1/2. The agreement with the SDA prediction is very good 
in most of the spectrum. However, there is a tail extending beyond A c , shown in the 
inset. We have checked that this tail was not due to finite-size effects by repeating 
the diagonalization for different values of N. We have also checked that eigenvalues 
in the tail are due to localized states, by computing the inverse participation ratio 
for these states, which does not scale as 1/N in the large N limit. However, the 
height of the peaks predicted within the SDA is much too small to explain this tail 
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quantitatively. We believe that successive iterations of the approximation scheme 
are necessary to describe it, by taking into account the "cooperative effect" of neigh- 
bours of a site of large connectivity having connectivities slightly higher than the 
mean. As argued in Section |] we expect the SDA to be correct at large eigenvalues. 
However, the weight in the asymptotic tail is so small that it would require very 
large matrix sizes and a very large number of samples to be observed. Because of 
the high computational cost of matrix diagonalization, it seems impossible to study 
this phenomenon directly. In the conclusion we propose an alternative method to 
analyse it. 

We have also studied numerically the influence of the asymmetry parameter a 
on the average density of states. In Fig. |7| we plot the average density of states, 
computed on 2000 samples of 800 x 800 matrices, with mean connectivity p = 10 
and a purely ferromagnetic distribution, a — 1. We find a very similar spectrum in 
the region [— A c , A c ] with the addition of a separated bump at larger eigenvalues. The 
total weight in the bump is, up to numerical precision, 1/N (we have checked this 
scaling for different values of N), suggesting that for each realization of the matrix, 
the largest eigenvalue is in the bump. This assumption was confirmed directly, by 
treating separately the largest eigenvalue of each sample. 

For N = 800 and p — 10 we found that the location of the largest eigenvalue 
decreases smoothly from a = ltoa~0.65, where it approximately approaches 
the edge of the a = 0.5 distribution. This observation is not contradictory with 
the previous statements on the independence on a of the density of states in the 
thermodynamic limit, as this effect is of order 1/N. A detailed study of the N 
and a dependence is delicate and it goes beyond the scope of this work. Yet, it is 
important to keep in mind this effect for realistic implementations where finite size 
effects are important. 



6 Conclusions 

The study of random matrices is a very rich field. For sparse matrices, there has 
been interest in different ensembles (Laplacians || [12|, 21 1, incidence matrices ||, 



"real- world" graphs f24}| , banded matrices |[25||), and in different properties (large 



eigenvalue tails, behaviour around small eigenvalues, quantum/classical percola- 
tion p6| ). In this paper we focused on the specific problem of the large eigenvalue 
tails. We studied the properties of finite-size clusters and derived recursion relations 
on characteristic polynomials. This allows one to solve exactly the problem for low 
concentrations p. In addition we used a replica variational method |12| to study the 
large p limit. The combination of these two methods gives a very simple geomet- 
ric content to the asymptotic form of the eigenvalue tails previously obtained with 
involved calculus ||. It is worth mentioning though a drawback of the method we 
used: it is not straightforward to estimate the range of validity of the result and to 
bound the corrections that successive levels of iterations generate. 

As mentioned in Section [|, the direct exploration of the tails in the eigenvalue 
spectrum by means of numerical diagonalisation is a very difficult task, as these tails 
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Figure 6: Density of states averaged over 500 samples for p — 10, a = 1/2, the 
matrix size is N = 2000. The dashed curve represents the SDA result for this value 
of p and it is almost totally superposed to the numerical data apart from a small 
deviation near the edges, shown in the inset. 



are due to very rare events. In a forthcoming paper [TJJ] we shall use an indirect 
method that might make this study easier. Namely, we shall consider the dynamics 
of a spherical model with finite connectivity interactions, whose dynamics is directly 
related to the eigenvalue spectrum considered here. The dynamics in the long time 
limit is controled by the large eigenvalue behaviour of the density of states and its 
study will hence give us information about the tails of the spectrum itself. 
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Figure 7: Density of states averaged over 2000 samples for p — 10, a — 1 and a 
matrix size N = 800, with a dashed curve the SDA result for this value of p. Note 
the ferromagnetic bump at fj, ~ 11. 
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